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Abstract 

The study of asymptotic properties of the conjugacy class of a random element of the fi- 
nite affine group leads one to define a probability measure on the set of all partitions of all 
positive integers. Four different probabilistic understandings of this measure are given-three 
using symmetric function theory and one using Markov chains. This leads to non-trivial enu- 
merative results. Cycle index generating functions are derived and are used to compute the 
large dimension limiting probabilities that an element of the afhne group is separable, cyclic, 
or semisimple and to study the convergence to these limits. The semisimple limit involves both 
Rogers- Ramanuj an identities. This yields the first examples of such computations for a maximal 
parabolic subgroup of a finite classical group. 

Key words: Conjugacy class, classical group, affine group, Hall-Littlewood polynomial, symmetric 

function. 

1 Introduction 

The conjugacy classes of the unitary groups with complex entries are simply the set of its eigenval- 
ues. Thus the enormous body of recent work on eigenvalues of random matrices is to a large extent 



a study of conjugacy classes. Given the power of random matrix models (see for instance |KS1| 



and |KS2] which show the predictive power of random matrices for the study of local properties of 
the zeros of the Riemann zeta function), it is natural to study conjugacy in random matrices over 
finite fields as well. 



The papers ||F1| , ||F2| , Q, |lF3| give a purely probabilistic understanding of conjugacy classes 
in the finite classical groups and in the group of upper triangular matrices over a finite field. One 
outcome was a simple and motivated proof of the Rogers-Ramanujan identities WM- 

The current paper considers the affine group A{n, q) (all matrices in GL(n + 1, q) with xn = 1 



and Xji = for j > 2) and the maximal parabolic subgroup P{n, q) (all matrices in GL{n + 1, q) 
with xii 7^ and Xji = for j > 2). 

Section g derives an expression for the chance that an element of A{n, q) or P(n, q) has a given 
rational canonical form in GL{n + 1,^), giving rise to cycle index generating functions for these 
groups. Section y also makes useful contact with the Hall-Littlewood symmetric functions. 

Section y opens by defining a new and natural probability measure N^^q on the set of all 
partitions of all positive integers and by relating it to an analogous measure for the finite general 
linear groups. Then it gives four purely probabilistic algorithms for "growing" random partitions 
according to Nu^q and also an algorithm which arises when considering conjugacy classes only of 
unipotent elements in A{n,q) or P{n,q). 

Section ^ gives non-trivial enumerative applications of the algorithms of Section ^. For instance 
the n ^ oo chance that an element of A{n, q) or P{n, q) has a fixed space of a given dimension is 
shown to have a simple product formula, where each factor in the product has a clear probabilistic 
meaning. The number of unipotent elements in A{n, q) or P{n, q) of a given rank is computed. 

Section |5| of this paper uses the generating functions of Section |2| to calculate the probabilities 
that an element of A{n, q) or P(n, q) is separable, cyclic, or semisimple. Analogous results are 



known for the finite classical groups [F5],|W|,[ FNP ] and are important for computational group 



theory UNPU,|NP4]. The motivation for Section S is the fact that probabilistic estimates in maximal 



subgroups of the finite classical groups are also useful for group theory. To state a typical result, 
recall that a matrix is called cyclic if its minimal polynomial is equal to its characteristic polynomial. 
Whereas the n ^ oo limiting probability that an element of GL{n,q) is cyclic is equal to -, 7i / a ; 
Section 5 shows that the corresponding probability for A{n, q) or P{n, q) is i_ii„-ii/qi i+i/q^ ' which 
is asymptotically 1 — 1/q^ + 0{l/q^) rather than 1 — 1/q^ + 0(l/g^). It is worth emphasizing that 
at present even for GL{n, q) the only method for performing such exact calculations is through the 



use of generating functions. 

2 Cycle Indices, Partitions and Hall-Littlewood Polynomials 

To begin we recall some standard notation about partitions which will be used throughout the 
paper. Let A be a partition of some non-negative integer |A| into parts Ai > A2 > • • •• Let mj(A) 
be the number of parts of A of size i, and let A' be the partition dual to A in the sense that 
A^ = r7T,j(A) + mi+i(A) + • • •. Let n(A) be the quantity X]j>i(^ ~ l)Ai. It is also useful to define the 
diagram associated to A as having the j'th row consist of Xj boxes. For example the diagram of the 
partition (5441) is: 

D D D D D 
D D D D 
D D D D 

D 
The notation Px{xi,X2, • • ■ ;t) denotes the Hall-Littlewood symmetric function. The reader is re- 
ferred to Chapter 3 of [ Mac ] for a thorough discussion of its properties. The symbol (-)j will denote 



{l-l/q)---{l-l/q'). 

Next recall (e.g. Chapter 6 of [Q) that the conjugacy classes of GL{n,q) are parameterized 
by rational canonical form. This form corresponds to the following combinatorial data. To each 
monic non-constant irreducible polynomial </> over the field of q elements Fq, associate a partition 
(perhaps the trivial partition) A<^ of some non-negative integer |A(^|. Let deg{cp) denote the degree of 
(f). The only restrictions necessary for this data to represent a conjugacy class are that the partition 
corresponding to the polynomial (p(z) = z is empty and that J2(j, \''^(t>\deg{(p) = n. 

This conjugacy data arises explicitly as follows. Given an element a G GL{n, q) operating on 
the vector space V, there is a unique direct sum decomposition ^ = F^ where the characteristic 



polynomial of a on V^ is a power of (j) and the characteristic polynomials on any two summands 
are relatively prime. Furthermore a decomposes each V^ into a sum of cyclic subspaces. This 
decomposition need not be unique but the dimensions of the cyclic subspaces in the decomposition 
of V^ are unique and are the row lengths of the partition A,^. For example, the identity matrix 
has \z-i equal to (l*^) and an elementary reflection with a 7^ in the (1,2) position, ones on the 
diagonal and zeros elsewhere has \z-i equal to (2, 1"~^). As another example, the characteristic 
polynomial of an element with conjugacy class data {A,^} is JltAT^z (jy'^'t'K 

The first aim of this section is to find expressions for the chance that an element of A(n, q) 
or P(n, q) has given rational canonical form data. This is a cruder invariant than conjugacy in 
A{n, q) or P{n, q). However most conjugacy class functions on A{n, q) or P{n, q) of interest depend 
only on the Jordan form in GL{n + l,q). (A similar reduction proved useful |Q,|F3] for upper 



triangular matrices over a finite field, for which the theory of wild quivers implies that there is no 
finite parameterization of conjugacy classes). 

To find the chance that an element of A{n, q) or P{n, q) has a given rational canonical form we 



use a result of Nakada and Shinoda [NaS] on the parameterization and sizes of conjugacy classes in 
A{n,q) or P(n,q). Then a combinatorial reformulation followed by a summation gives the result 
we seek. 

For the statement of Theorem ll], we use the notation that |Zg(c)| is the size of the centralizer in 
the group G of any element in the conjugacy class c. The symbol F* denotes the non-zero elements 
in Fq. We use the convention that GL{0,q) consists of one conjugacy class of size one. 

Theorem 1 (j^a^) 

1. The conjugacy classes of A{n, q) are parameterized by pairs (c„+i_fc, k) where < k < n + 1 
and Cn+i-k is a conjugacy class of GL(n + 1 — k,q). Letting {A^} he the conjugacy class 
data corresponding to Cn+i~k, the GL{n + l,q) rational canonical form of the corresponding 
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class in A{n, q) is given by A^ = A^ for (j) ^ z — 1 and by letting \z~i = (k) U Xz-i be the 



partition formed by adding a row of length k to Xz-i- The centralizer size of the corresponding 
conjugacy class is 



2. The conjugacy classes of P{n,q) are parameterized by triples (c„+i_fc, A;,a) where < k < 
n + 1, a G F* and Cn+i^k is a conjugacy class of GL{n + 1 — k^q). Letting {A^} be the 
conjugacy class data corresponding to Cn+i-k, the GL{n+l,q) rational canonical form, of the 



corresponding class in A{n, q) is given by A^ = A^ for (p ^ z—a and by letting Xz~a = {k)yj\z-a 



be the partition formed by adding a row of length k to Xz-a- The centralizer size of the 
corresponding conjugacy class is 



Lemma || below was Theorem 9 in the thesis [FO]- Of course the formula for the conjugacy 



class sizes in GL{n,q) is not due to the author (see for instance page 219 of [Mac] for the third 
expression), but the combinatorial rewritings of it in Lemma || are very useful. 



Lemma 1 ([FCl) The conjugacy class of GL{n,q) corresponding to the data {A^} has size 

\GLin,q)\ 
Il<t,^zCGL,^,g{X^) 

where 



CGL,0,,(A) = g2rf'=3('^)E'.<. '^'"'.W-^ W+l E.(-l)-« W'l Y[ \GL{m,{X),q 



deg(4>) \ 



q' 



deg(<j,) )mi{\) 
.deg{(f>)n{\) 



P ( ^ 1 1 \ 



Theorem ^ is our first main result. The notation used in the proof is the same as in the statement 
of Theorem ||. Recah that \', ^ is the size of the first column of A^. 

Theorem 2 1. The number of elements of A{n,q) with GL{n + l,g) rational canonical form 
data {A^} is equal to 

|A(n,g)|((?^-i.i-l) 



2. The number of elements of P{n,q) with GL{n + l,g) rational canonical form data {A^} is 
equal to 

y |P(n,(?)|(/-«.i-l) 1 



Proof: First we consider A(n^q). From part 1 of Theorem |^ the number of elements of A(n,g) 
with GL{n + 1, q) rational canonical form data {A^} is 

y- \A{n,q)\ ^ 

(c„ +l-fc. fc) \^GL{n+l-k,q)\Cn+l-km ^'-^ ' ^i-fc 

From Lemma ||, |-^GL(n+i-A;,g)(cn+i-fc)| can be written as 

■^2 . 1 . 



11 11'' ^„deg(4>)'m,{\^y 

4>^z i " 

Thus the number of elements of A{n,q) with GL(n + l,q) rational canonical form data {A,^} is 



\A{n,q)\ Yl ^ ^ 



jk-l+2j2ti im.{A._i)+{2fc-l) J2tk ™«(^--i) J]. q(K-i,i^\l' 



. ]J ]J_ _J 

Consequently it is sufficient to prove that 



y = '- = = 

{qK-1,1 _ 1) 



^qJmi(Xz-i) 



Since only partitions corresponding to the polynomial z — 1 are involved in this last equation, we 
simplify notation by suppressing the dependence on z — 1 and by further replacing mj(A) and mj(A) 
by rrii and rrii. 

From Lemma ^ one sees that 



E 



fc>l,(fc)UA=A 

y I 

fc>l,(fc)UA=A 



Clearly if (k) U A = A then rrii = irii for i i^ k and m^ = m,}^ + 1. Elementary combinatorics 
then shows that this last expression is equal to 



y \ 

fc>l,(fc)UA=A 






fc>l,(fc)UA=A 

^ ^^ — 5 y (1 - i/g™^)/* 

1 x-^ , \' ),' 



q ^.<, ' . ^. . lli(^)m, fc>l 



K - A+i^ 



(g^^-l)- 



(9"^-l): 



.A' ^^ 1 



n.'?(^-)^(|u' 



where the final equahty is Lemma [l|. This completes the proof of part 1 of the theorem. Part 2 
follows from part 1 and Theorem |l|. □ 

Given Theorem pi it is now straightforward to write down the corresponding cycle index gener- 
ating functions. These will be applied in Section Isl. 

Corollary 1 Let x^^x be a collection of variables. Let {A^(a)} he the rational canonical form, data 
of any a € GL{n + l,q). Then 



Y. n ^0,A4a) 



A:|A|>0 -li*^ lgM,{A) 4,^z,z~l \ i-i-iH K qdea{4,) ) m^{\) 



2^ \P(„ „\\ 2^ 11 ^0,a^(q) 

n=0 I V'-jy;! aePin,q)^j^z 



aGF,* A: I A|>0 Vy J^Jlljy IqMi(A) .^^^^^-a A Hiy lgde9(« M^CA) 

3 A Measure on Partitions and Probabilistic Algorithms 

To begin we define a probability measure which will be the object of study in this section. Recall 
that n(A) is the quantity J2i>ii'^ ~ l)-^!- 

Definition: For < u < 1 and q > 1 the measure Nu^q on the set of all partitions of all positive 
integers is defined by 



Nu,qW = 11(1-4) ^ ^ 



.=1 ^ 5" 



li*^ ^r^ n(A) 
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The equivalence between the two definitions of Nu^q{X) fohows from Lemma |l]. 

Before proving that N^^q is indeed a probabihty measure, we recall the analogous measure Mu^q 



on the set of all partitions of all natural numbers. The survey [F3] summarizes what is known 
about Mu^q- 



M.„(A) ^ m - p^^ 



mi (A) 



oo 



r=l 



na-^)— ^^ '" 



Lemma § proves that N^^q is a probability measure. We include two proofs-one using an identity 
of Macdonald from symmetric function theory and another using the cycle index of A(n, q) and 
the fact that Mu^q is a probability measure for q > I and < u < 1. (Two other proof methods 
are to use either Steinberg's result that there are g" unipotent elements in A(n, q) together with 
the cycle index or else to use the Markov chain construction later in this section together with an 
identity of Cauchy) . 

Lemma 2 If q > 1 and < u < 1 then N^^q defines a probability measure. 

Proof: (First proof) Equation 3 on page 219 of [Mac] states that 



n(i + ^.y)/(i - ^.) = E *"^'^ It (1 + *'~'y)^A(^; t). 

i>l X j=l 

The result now follows by first setting Xj = u/q^,t = 1/q and taking coefficients of y on both sides, 
and then using the fact that P\{u/q,u/q^ , • • • ; 1/q) = w^\P\{\/q, l/g^, • • • ; 1/g). Q 

Proof: (Second proof) Setting all x^^x in the cycle index of A{n, q) equal to 1 gives the identity 

-| oo 1 oo -1 

r^^^ni^^^T^ E N^sW) j{ (IT 1 _ ^de,w / ..de,(0) ) E ^v^«(^) ,.-«(« w- 

r=l '^ A:1A|>0 <j>^z,z-l r=l '^ A 



11 



Since Mu^q is a probability measure it follows that 

1 °° 1 

(j)^zr=l 1^ A;|A|>0 

Setting all variables in the cycle index for GL{n, q) equal to 1 and using the fact that M„^q is a 
probability measure implies that 



1 "" 1 

ur 



I — U ^^ ^^ \ — udeg{(j>) /„r-deg{()>) 



(of course this can be proved directly). The lemma follows. □ 

We require an elementary lemma about Taylor series. For its statement [u"]/(u) denotes the 
coefficient of vP" in a polynomial f{u). 

Lemma 3 If f{l) < oo and the Taylor series of f around converges at u = 1, then 

I — u 
Proof: Write the Taylor expansion /(u) = Y.'^=q O'nu'^ ■ Then observe that [u"']^ = Y.7=o<^i- ^ 

Theorem ^ shows that the measure N^^q is a fundamental object for understanding the proba- 
bility theory of A(n, q). We remark that the idea behind it-auxiliary randomization-is a mainstay 
of statistical mechanics. 

Theorem 3 1. Fix u with < n < 1. Then choose a random number N with probability 
of getting n equal to (1 — u)u^ . Choose a uniformly in A{N,q). Then as (p varies any 
finite number of the random partitions X(f){a) are independent random variables, with Xz-i 
distributed according to the measure Nu^q and all other \^ distributed according to the measure 
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2. Choose a uniformly in A{n,q). Then as n ^> oo, any finite number of the random partitions 
X(f,{a) are independent random variables, with Xz-i distributed according to the measure Ni^g 
and all other X^ distributed according to the measure M^ „deg(4,) ■ 

3. Fix u with < u < 1. Then choose a random number N with probability of getting n equal to 
(1 — u)u^ . Choose a uniformly in P{N,q). Then as (p varies over polynomials of degree at 
least two, any finite number of the random partitions X^{a) are independent random variables 
distributed according to the measure M^deg(4>) „deg(4,) ■ The partitions Xz^a are independent of 
X(f) for deg{(f)) > 2 but depend on each other; any particular Xz-a has as its distribution the 
rmxture^+^S^^^. 

<j-l q-l 

4- Choose a uniformly in P{n,q). Then as n ^> oo and (p varies over polynomials of degree at 
least two, any finite number of the random partitions X(f){a) are independent random, variables 
distributed according to the measure M^ „deg(,i,) . The partitions X^-a are independent of X^ for 

deg{(j)) > 2 but depend on each other; any particular Xz-a has as its distribution the mixture 

TVi^ (£-2)Mi^ 
q-l "^ g-1 • 

Proof: As explained in the second proof of Lemma 0, we know that 



1 "" 1 

nn 

0^2 r=l 



I — U -I--I- -IJ- 1 _ udeg{(j>) ^qr-deg{(f>) ' 



Multiplying this by the cycle index of A{n, q) gives that 

°° (1 - u)n" 

^ T47 W ^ ]1^4>A4ci) = { Yl Xz-l,xNu,q{X)) Yi CYx4',^^u'''=sM,qdegw{X)). 

n=0 \^\'"-^1)\ aeA{n,q) <l>^z A:|A|>0 ^^z,z-l A 

This proves the first assertion of the theorem. For the second assertion use Lemma |^. The proofs 
of the third and fourth assertions are almost identical so are omitted. □ 

The remainder of this section considers probabilistic methods for growing random partitions 
according to the measure N^^q, analogous to those for M^^q in |F0|,|F1]. For this recall that a 
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standard Young tableau T of size n is a partition of n with each box containing one of {1, • • • , n} 
such that each of {1, • • • ,n} appears exactly once and the numbers increase in each row and column 
of T. For instance, 

S 
S 



[8] m 
is a standard Young tableau. 

Our first method for growing partitions according to the measure Nu^g is an algorithm we call 
the Affine Young Tableau Algorithm, so as to distinguish it from the Young Tableau Algorithm for 
growing partitions according to the measure M^^q- For its statement, we need the Young Tableau 



Algorithm |Fg,^| 



The Young Tableau Algorithm 



Step Start with A^ = 1 and A the empty partition. Also start with a collection of coins indexed 
by the natural numbers, such that coin i has probability 4 of heads and probability 1 — t- 
of tails. 



Step 1 Flip coin N. 

Step 2a If coin N comes up tails, leave A unchanged, set A^ = A^ + 1 and go to Step 1. 

Step 2b If coin N comes up heads, choose an integer 5 > according to the following rule. Set 

5 = 1 with probability ^ a/_-^ . Set S = s > 1 with probability 2 —erzi • Then 

increase the size of column s of A by 1 and go to Step 1. 

As an example of the Young Tableau Algorithm, suppose we are at Step 1 with A equal to the 
following partition: 

14 



n n n n 
n n 
n 

Suppose also that iV = 4 and that coin 4 had already come up heads once, at which time we added 
to column 1, giving A. Now we flip coin 4 again and get heads, going to Step 2b. We add to column 
1 with probability -^^—^ , to column 2 with probability C^f , to column 3 with probability %rE^ , to 
column 4 with probability 0, and to column 5 with probability ^-j^^. We then return to Step 1. 

Theorem 4 ( ^gj , ^^]) For < u < 1 and q > 1, the Young Tableau Algorithm generates parti- 
tions which are distributed according to the measure Mu^g. 

Next we point out that the Young Tableau Algorithm can easily be made to terminate. This 



idea was developed in joint work with Mark Huber surveyed in [F3] and goes as follows. Let a^ be 
the number of times that coin N comes up heads; the idea is simply to first determine the random 
vector (ai, a2, • • •) and then grow the partitions as in Step 2b of the Young Tableau Algorithm. So 
let us explain how to determine (ai,a2, • • •). For A^ > 1 let t^^' be the probability that all tosses 
of all coins numbered N or greater are tails. For A > 1 and j > let t- be the probability that 
some toss of a coin numbered A or greater is a head and that coin A comes up heads j times. It 
is simple to write down expressions for t^^\tQ ,t\ , ■ • • and clearly t^^' + J2j>o^ = 1- The 
basic operation a computer can perform is to produce a random variable U distributed uniformly 
in the interval [0, 1]. By dividing [0, 1] into intervals of length t^^', Vq ,t\ ,■ ■ ■ and seeing where U 
is located, one arrives at the value of ai. Furthermore, if U landed in the interval of length t^^' 
then all coins come up tails and the vector (ai, 02, • • •) is determined. Otherwise, move on to coin 
2, dividing [0, 1] into intervals of length t*-^-*, ig ; n > ' ' ' ^^^ so on. 
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Now we describe the AfHne Young Tableau Algorithm. 

The Affine Young Tableau Algorithm 
Step 1 Run the Young Tableau Algorithm so as to generate a partition A distributed as Mu^q- 
Step 2 Set S = 1 with probability —-r and S = s > 1 with probability —p r, — . Then increase 

q 1 g « q s-l 

the size of column s of A by 1. 

Theorem |6| shows that the Affine Young Tableau Algorithm grows partitions distributed as N^^q. 
For its proof, we use a different method for generating M^^q- Recall that the Young lattice is the 
set of all partitions of all natural numbers, with a directed edge drawn from the partition A to 
partition A if the diagram of A is contained in the diagram of A and |A| = |A| + 1. 

Theorem 5 (ff^) Put weights m\^\ on the edges of Young lattice according to the rules: 

1- '^A.A = ^7 — -^^ ^/ ihe diagram of A is obtained from that of X by adding a box to column 

q l{q I -1) 
1. 

2. m\ A = -^ — T7-^ if the diagram of A is obtained from that of A by adding a box to 

q^l-l 

column s > 1. 



Then the following formula holds: 



Mu,q{X) 



.r=l 



U , 



IAI-1 
7 J=0 



where the sum is over all directed paths 7 from, the empty partition to \, and the ji are the partitions 
along the path 7. Furthermore, if the Young tableau T corresponds to the path 7 then the probability 
that the Young Tableau Algorithm outputs T is equal to 



00 




|A|-1 


11(1 


--) 


n ^f.,ii+i- 


Lr=l 


1 J 


i=0 
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Now we can show that the Afhne Young Tableau Algorithm works. 

Theorem 6 The Affine Young Tableau Algorithm grows partitions distributed as N^^q- 

Proof: Let A'*' denote the the shape obtained by decreasing the size of column s of A by one and 
let Mu,q{^^^') = if A'^^ is not a partition. To prove the theorem it is sufficient to show that for 
|A|>0, 

^M„,,(A«) + E(^ - ^)M„,,(A(^)) = Ar„,,(A). 
Theorem shows that 



i3TM„,,(A«) + E(7Ajn - ^)^4,,(A(^ 



\l - '" 

9^^ S>1 

r=l y s>l^;0^A('')^A «=0 

where the inner sum is over all paths in the Young lattice from the empty partition to A''') to A and 
all other notation is as in Theorem H. Simplifying further and using Theorem again gives that 



u 






11(1 - ^)[^ Y. n "^7.7,+J = M^M = N^,q{X). 



1 Q u „ , ■ n 



D 



Remarks: 

1. Theorem |6| gives another proof that N^^q is a probability measure. 

2. From Theorem ^ one has an explicit formula for the chance that the Young Tableau Algorithm 



outputs a given Young tableau (page 565 of |F1|). Since the Affine Young Tableau Algorithm 



randomly adds on one additional box, one can write down a formula for the chance that it 
generates a given Young Tableau. 
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3. Two other probabilistic algorithms are given for growing partitions according to Mu „ on 



pages 581 and 585 of [Flj. These yield algorithms for sampling from Nu^q simply by adding 



an additional box as in Step 2 of the Affine Young Tableau Algorithm. 

Next we describe a rather surprising method for sampling from N^^q using Markov chains. We 
use the notation that (-), is equal to (1 — u/q) •••(! — u/q^). Proh.{E) will denote the probability 
of an event E under the measure N^ q- 



,T-roo (]^_„/„r\\^a-l 

Theorem 7 Starting with A'^ = a > 1 with probability Q{a) = lT^^,u^ /is > define in succes- 

<?" "(-)o(-)a-l 

sion A2, A3, • • • according to the rule that if A^ = a, then \[j^i = b with probability 

Then the resulting partition is distributed according to N^^q. 

Proof: The Nu^q probability of choosing a partition with A^ = rj for all i is 

Prob.iX, - r,) [[ p,,5.(^,=,^^...^^,=,^) • 
Theorem |l^ of Section Q shows that Prob.{X'i = ri) = Q{ri). Thus it is enough to prove that 

Prob.jXi = ri, • • • , A^_i = r,_i, A^ = a, A^^^ = b) _ n''(i),(f ), 

Prob.{X[ = ri, • . . , A^_i = r,_i, A^ = a) gf>^(i)^_,(l),(H), ' 

for all i > l,a, 6,ri, • • • ,r^_i. 

^^^u^u:K-j-,, +u^-j- \' ^ :-j- ;^ ^^^,,^J ;^ IIt7iI1 4-U^4- '~r/'^\ 



Letting T{a) be the Mu^q probability that A'^ = a it is proved in |F1| that T(a) — 2_ 



' q'"-^ q ' 



Next one calculates that for i > 2 






\0Jr1-r2 \„)ri-2-ri-i\o)ri-i-a 



'■l'-">-^i_l=''i-l ^ \q/'i-'2 \q/it-2-'i.-i\q 



Similarly, observe that 



y Nu q{X) = — -, ^ -^ T{b). 



^g/'i '^ \ q/ > %—z I % — i \ q^ 
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Thus the ratio of these two expressions is 

as desired. The case i = 1 must be checked separately but the same ratio results. □ 

The algorithm of Theorem [^ runs on a computer because of the well known fact that to sample 
from a discrete distribution one divides [0, 1] into intervals of the appropriate lengths and generates 
a uniform variable in [0, 1]. 

To conclude the section we explain how to sample from N^^q given that the size of the partition 
is n + 1, which is clearly the same as sampling the partition A^_i for a unipotent element of A{u, q). 
One naive method is to keep picking from N^^q until one obtains a partition of size n; however this 
is very slow and not theoretically useful. 

Algorithm for Sampling from N^^q given that |A| = n + 1 

Step Start with A^ = 1 and A the empty partition. 

Step 1 If n = then go to step 3. Otherwise set h = 1 j^. 

Step 2 Flip a coin with probability of heads h. 

Step 2a If the toss of Step 2 came up tails, increase the value of A^ by 1 and go to Step 2. 

Step 2b If the toss of Step 2 comes up heads, decrease the value of n by 1, increase A according 
to the rule of Step 2b of the Young Tableau Algorithm (which depends on A^), and then go 
to Step 1. 

Step 3 Perform Step 2 of the Affine Young Tableau Algorithm. 

Before proving Theorem IM we remark that it is an adaptian of an analogous result for GL{n, q) 



which was obtained in joint work with Mark Huber, surveyed in [FS] 
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Theorem 8 The algorithm for sampling from, Nu^q given that |A| = n + 1 is valid. 



Proof: The survey |F3| proved that if one replaced Step 3 in the above algorithm by stopping 
then one would sample from M^^q given that |A| = n. Let A*^*^ denote A with the size of column s 
decreased by one and let E{s) be the probability that Step 2 of the Affine Young Tableau Algorithm 
adds to column s of A'-'^^ The result follows because 



EA;|A|=n+l ^u,qW EA:|A|=n ^^u,qW 

E,M.,g(AW)i^(g) 

J2x:\X\=n^'Iu,qW 

Mu,q{^^'^) 



E^(^ 



Ex:\X\=nMu,qW 



D 



4 Applications of Probabilistic Algorithms 

This section consider applications of the probabilistic algorithms of Section y. The results here 



parallel those of |F1| for GL(n,q) but are genuinely different. We do omit the formula for the 
chance that an element of A(n, q) has a given characteristic polynomial, as this follows from the 
GL{n, q) result and offers no new insights. Furthermore we only state the results for A{n, q) as the 
extensions to P{n, q) are trivial. This section uses the notation that [u"]/(u) denotes the coefficient 
of u" in a polynomial f{u). 

Let PA^nik, q) be the chance that an element of A{n, q) has a k dimensional fixed space and let 



PA,oo{k, q) be the n ^ cx) limit of PA,n{k, q)- Theorem 10 will show that 



^-^'"^^'^^-|A(fc-l,g)| S qk^{q^-l)...{q-l 
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and 



PA,oo{k, q) = [11(1 - -)] ^^ _ ^^^^2 . . . (1 _ l/gfc-l)2(l _ 1/^fc) ' 



r=l 



where we use the notation that |^(— l,g)| = 0. Furthermore, a probabihstic interpretation will be 
given to the products in PA,oo{k,q)- The first step is to connect the chance that an element a of 
A{n, q) has a k dimensional fixed space with the partitions in the rational canonical form of a. 



Lemma 4 (kF^) The dimension of the fixed space of an element a ofGL{n, q) is equal to X';^_i i(a) 
(i.e. the number of parts of the partition corresponding to the polynomial z — 1 in the rational 
canonical form of a). 

To proceed further, some notation is necessary. Let T be a standard Young tableau with k 
parts. Let Tuj) be the entry in row i and column j of T. We define numbers /ii(T), • • • , hk{T) 
associated with T. Let hm{T) = T'(^^;^ j^) ~^(m,i) — lforl<m<A: — 1 and let hk(T) = \T\ —T^^i^^iy 
So if A; = 3 and T is the tableau 






[8j [9j 
then hi{T) = 2-1-1 = 0, /i2(T) = 8 - 2 - 1 = 5, and /i3(T) = 9-8 = 1. View T as being 
created by the Affine Young Tableau Algorithm. Then for 1 < m < k — 1, hm{T) is the number 
of boxes added to T after it becomes a tableau with m parts and before it becomes a tableau with 
m + 1 parts. hk[T) is the number of boxes added to T after it becomes a tableau with k parts. 
The proof of Theorem ^ will show that if one conditions T chosen from the measure N^^q on having 
k parts, then the random variables hi{T), • • • , hj.{T) are independent, where hi{T), • • • , hk(T) are 
geometric with parameters ^ ,•••,-% • 
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Theorem 9 



E ^-^^w 



n^-i nr=i(i-i^) 



\Mk-l,q)\Ur=i(.^-f) 
Proof: We sum over all Young tableaux T with k parts the chance that the Affine Young Tableau 
Algorithm outputs T. Recall from the second remark after Theorem ^ that one can compute the 
probability that the Affine Young Tableau Algorithm outputs any given Young tableau. The point 
is that one can in fact compute the probability that the Affine Young Tableau Algorithm produces 
a tableau T with given values of the /I's. 

To do this, consider what happens during Step 1 of the Affine Young Tableau Algorithm. 
Suppose that one moves along the Young lattice from a partition with m parts. Theorem ^ implies 
that the weight for adding to column 1 is mi- m+i_ii ; and that the sum of the weights for adding 
to some other column is -^ . Thus the chance that Step 1 of the Affine Young Tableau Algorithm 
yields a tableau with given values hi,- ■ ■ , ht-i and all other /ii=0 is 



oo k—1 k—1 

17(1 - -) — IT i—)^ 



In order for the Affine Young Tableau Algorithm to generate a partition with m parts, either 
Step 1 generates a partition with m — 1 parts and Step 2 increases the size of column 1 or else 
Step 1 generates a partition with m parts and Step 2 increases the size of column s > 1. Thus the 
probability that the Affine Young Tableau Algorithm generates a partition with k parts is 






Tli^-^r) 



.r=l 
oo 



W 



fc-l 



A;— 1 oo 



U 



nd-^) 



.r=l 



\GL{k-l,q 
w 



n E & 

m=l hm=0 ^ 
fc-l k-1 oo 






^)] 



q>'-^\GL{k-l,q)\ 



n j:0'-iT.&] 



m=l hm=0 



hk>0 
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,k-l 



OO /-] u 



nr=i(i 



\Mk-l,q)\Ur=i{^-^y 



a 



One further lemma will be used. 



Lemma 5 d GoR j) 



OO OO 



OO / \1 

n(i-^) = y t^ 



r=l ^ i=0 

Now the formulas for PA^n{k,q) and PA,oo{k,q) can be proved. Recall that we use the notation 

that \A{-l,q)\ =0. 
Theorem 10 1. 

-. n-k+l I ^\i 



PAn(fe,g)= E 



KA;- 1,9)1 ^ gH(g*_i)...(g_i) 



OO Y U/q) 

PA,Uk,q) = [Jlil - 7)](i_i/,)2...(i_i/,.-i)2(i_i/^.)- 

Proof: In the equation of part one of Corollary |l| set x^-i^x = 1 if A has k parts and x^-i^x = 
otherwise. Also set x^^\ = 1 for (/) j^ z — 1. It follows from Theorem ^ and Lemma ^ that 

OO 1 

PAAk,q) = MIIIK ^ ..e.(.) ) E ^"..W 



0^2 r=l J^ qr-deg{4>) \:\'^=k 



A:A'j=A: 



(l-n)|A(A;-l,g)| 

1 - u ^ (o* - 1) • • • (g - 1) 



1 ..n-fc+l. 1 ^ (-1)^^'/-'=)^ 



|^(A:-l,g)r ^l-u^(g*-l)---(g 

1 ";^^ (-1)^ 
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For the second assertion of the theorem use Lemma ^ and Theorem ^ to conclude that 

PA,oo{k,q) = limn^oo[u"'h Yl ^n,qW 

1 n.~i(i-^) 

|A(/c-l,g)|nti(l-^) 
as desired. □ 



We remark that the analog of Theorem 10 for GL{n,q) was first proved by Rudavlis and 



Shinoda |RS|, using Moebius inversion on the lattice of subspaces of a vector space. Theorem 10 
can be proved along the same lines but we omit the details as the argument is less insightful and 
incongruous with the theme of this paper. 

The final result in this section is an enumeration of unipotent elements in A{n, q) of a given 
rank. 

Theorem 11 The number of unipotent elements of rank k in A{n,q) is equal to 

\A{n,q)\ (1 - l/gfc) . . . (1 - l/g>^) 

\A{k -l,q)\ g"-'=+l(l - 1/q) • • • (1 - l/q^-k+^) ' 

Proof: In the equation of part 2 of Corollary 111 set Xz-i,x = 1 if A has k parts and x^^\ = 
otherwise. Using Theorem one sees that the sought number is 



|A(n,g)|K 



k 



|^(fc-l,g)|nti(l-^) 



\M'n,q)\ rn-fc+ln-TT 1 

\A{k-l,q)r ^^1-^ 

\A{n,q)\ (1 - l/gA.) . . . (1 - l/g») 

\A{k -l,q)\ g"-*^+i(l - l/q) • • • (1 - 1/^"-'=+!) ' 

D 
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5 Applications of Cycle Indices 

The purpose of this section is to apply the cycle index generating functions of Section |^ to obtain 
precise estimates for the probabilities that an element of A{n, q) is separable, cyclic, or semisimple 
(these terms will be defined as needed). As these probabilities are identical for P{n,q) results will 
only be stated for A{n, q). The section is organized by discussing separable, cyclic, and semisimple 
probabilities, and in that order. 

The use of generating function methods to provide similar estimates for the finite classical groups 



has appeared in earlier papers (the paper [F5] computes all three limits for GL, [W| computes the 



separable and cyclic limits and bounds the convergence rates, and |FNP| obtains convergence rates 



for the semisimple case and extensions to the finite classical groups). The reader unfamiliar with 
the cycle index of GL{n,q) may wish to consult [Q or perhaps p^ or [F5] which give worked 
examples in our notation. 

The contribution of this section is the not obvious fact that cycle index methods can be applied 
to P{n,q), a maximal parabolic subgroup of GL(n,q). It is also interesting that the n ^ oo 
limiting separable, cyclic, and semisimple probabilities are all different from the corresponding 
limits in GL(n,q). 

The following notation will be used throughout this section. N{d, q) will denote the number of 

monic degree d irreducible polynomials over Fg. N (d, q) is defined by N (d, q) = N{d, q) for d > 1 

and N'{l,q) = N{l,q) — 2. The following elementary lemmas will be useful. 
Lemma 6 

Proof: Expanding h^i^ ^^ ^ geometric series and using unique factorization in -Fg[x], one sees 

that the coefficient of u in the reciprocal of the left hand side is -j times the number of monic 
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polynomials of degree d, hence 1. Comparing with the reciprocal of the right hand side completes 
the proof. □ 

Lemma 7 (Darboux ^) Suppose that f{u) is analytic for \u\ < r,r > and has only simple poles 
on \u\ = r. Letting Wj denote the poles, and gj{u) he such that f{u) = i^^Jl^ and gj{u) is analytic 
near Wj, one has that as n ^> cx), the coefficient of vP' in f{u) is 

5.1 Separable Matrices 

An element a in A{n, q) is called separable if its characteristic polynomial is square free. We remark 
that the results of this subsection (but not those of the cyclic and semisimple subsections) could 
also by obtained by using the cycle index for GL(n,q). 

Let s^in, q) be the proportion of separable elements in A{n, q) and define the generating function 
SAiu,Q) = J2'^=o''^^SA{'n',q) and let Sgl{u-,q) be the corresponding generating function for GL. 

Theorem 12 

^ SGL{u,q) 
(1 + i^) 

Proof: The first equality follows from Corollary || together with the fact that an element a of 

GL{n,q) is separable if and only if all Xfj,{a) have size at most 1. The second equality follows from 

the cycle index for GL{n, q). □ 



Wall [Q shows that SGL{u,q) is analytic in \u\ < q except for a simple pole at u = 1 which 
has residue 1 — 1/q, implying that sgl{oo, q) = 1 — -. To compute the corresponding limit for the 
affine case note that for q > 2, Sa{u, q) is analytic in \u\ < q — 1 except for a simple pole at u = 1 
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1-i 
which has residue j . For q = 2, since N'{l,q) = the function SA{u,q) is analytic in |n| < q 

+ 9-1 

1_1 1-1 

except for a simple pole at n = 1 which has residue ^ , I ■ Thus SA(oo,g) = _, , f . 

^+9-1 -"-+9-1 



Theorem 13 bounds the convergence rate of SA{n, q) to its limit. The statement of the theorem 



is not fully simplified so as to make the proof easier to follow. 
Theorem 13 Let c = 3/2 and K+ = -0^(1 + ^)(^). Then 

1. \sA{n,q)-SA{oo,q)\ < ^Jl^^tj-%%) + (i-i/(g-i))(g-i)n+i forq> 2. 

2. \sAin, q) - SAioo, q)\ < "^^^^If^yi for q = 2. 

Proof: Theorem |^ imphes that 

(1 - u)SAiu, q) = —-—^--Sgl{u, q). 

V^ + q-l) 

Taking coefficients of w™+^ on both sides and using the fact that sgl{^, q) = sgl{0, q) one obtains 
that 



-, m -. 

\sA{m+l,q)-SAim,q)\<{^)"^[Y^{q-iy\sGLii + hq)-SGLii,q)\] + ^—^y^. 



For any c, K~^ as above, page 273 of [W| gives the bound 

\sGL{i,q) - SGL{oo,q)\ <K+{c/q)\ 
Thus by the triangle inequality, 

\sGL{i + 1, g) - SGL(i, q)\ < 2K+{c/q)\ 
Now summing over m > n and using the triangle inequality gives 



1 "^ — 1 1 

\sA{n,q)-SAi^,q)\ < 2 ^ (— y r[E(^)^^^^i + ^ (a - 1)^+^ 
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^ 2K c ^ / c ^^ 1 

2K+c{c/{q - 1))" 1 



(c-l)(l-c/(g-l)) (l-l/(g-l))(g-l)"+l- 

For q = 2 observe that taking coefficients of u™"'"^ on botli sides of 

(1 - u^)SAiu, q) = {I- u)Sgl{u, q) 

gives that 

\sA{'m + 2,q) - SAim,q)\ < \sGLim + 2,q) - SGL{m+ l,q)\. 

Now use Wah's bound for \sGL{i-,q) — sgl(oo,(?)| to conclude that 



\sA{n,q) - SA{oo,q)\ < ^ \sA{m + 2,q) - SA{rn,q)\ 

m — n even 

< Yl \sGL{m + 2,q) - SGL{m+l,q)\ 

Tn>n 
m — n even 

< 2 Y. K+{c/qr+^ 

m>n 
m — n even 

2K+(c/g)"+i 



< 



1 - {c/qf 
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5.2 Cyclic matrices 

An element a of GL(n, q) is called cyclic if its characteristic polynomial is equal to its minimal 
polynomial. Let CA{n,q) be the proportion of cyclic elements in A{n,q) and let CAioo,q) be the 
n ^ oo limit of c^(n, g). Let C^(u, g') = X]^o^"'^^(^''?) ^^^ ^^^ Cgl{'U'-,q) be the corresponding 
generating function for GL. 
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Theorem 14 



-CGL{u,q). 



l-u/ql+ (,,i)(^„/,) 

Proof: The first equahty follows from Corollary || together with the fact that an element a of 
GL{n, q) is cyclic if and only if all X^{a) have at most 1 part. The second equality follows from the 
cycle index for GL{n,q). □ 

Corollary 2 

1 - I 1 - 1/q^ 



CA{oo,q) 



^-l + h^ + Vi'' 



Proof: Wall |W| shows that CGL{u,q) is analytic in \u\ < q except for a simple pole at n = 1 



which has residue , ,('^3 . Theorem 14 implies that Cyi(ii, g) is analytic in \u\ < q^ — q except for a 



1+1/g^ 



. L_i T^rmo r> * ( rv^ /nr \ ic omitil "Ir^ 2 L_i_ 



simple pole at u = 1 which has residue - — i ^ i 111 /a • Thus ca(oo, g) is equal to - — 1 ,^ 1 111 /a • '^ 



9 q^ q q^ 



For large g the limit in Theorem |2| is of the form 1 — 1/g^ + 0{l/q^). It would be interesting 
to understand this in terms of algebraic geometry; the 1 — 1/q^ + 0{q'^) behavior in the case of 
GL(n,q) is a finite analog of Steinberg's result that the variety of regular semisimple elements has 



codimension three IS^]. See [ N"P3 | for further discussion. 



Adapting a trick of Wall | W] from the GL case gives instant bounds on the convergence rate of 



CA{n,q) to its limit. 
Lemma 8 



{l-u)GA{u,q) = {l--)SA{-, 

q q 



Proof: Theorems 14, 12 and Lemma y imply that 



4 



i^-u)CAiu,q) ^ ^ rid + .,, _ ,.,-., _ , , .x '^---) rid - n-/.r^---) 



V'z^Vi 9^(i-tiW)(i-i/9'')- 



d>l 
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,d 



^ \N'{q,d) 



<'-"/'>n(i + ;rt;?3T)) 



d>l 



11 

{l-u/q)SA{-,q)- 



u 



Corollary 3 

\cA{oo,q)-CA{n,q)\< ^„+i^^ _ -^^^^ 
Proof: Taking coefficients of u"*"^^ on both sides of the equation in Lemma ^ gives that 

CA(m + l,g) -CA{m,q) = -^^^[sA{m + l,q) - SA{m,q)]. 
Using the triangle inequality and the fact that < SA{m + 1, g), SA{m, q) < 1, it follows that 



\cA{oo,q) -CA{n,q)\ < ^ \cA{m + l,q) - CA{m,q)\ 



m=n 

oo 



J2 t;;ittI'^^("^ + ^'^) ~*^("^'^)l 



m=n ^ 



oo 1 

— Z—t r,m- 
1 



< Qm+1 
m=n ^ 



g"+i(l-l/g)- 
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5.3 Semisimple matrices 

An element a of GL{n, q) is called semisimple if it diagonalizable over the algebraic closure of 



Fq. Treatments of semisimple probabilities without generating functions appear in |IsKanSp| and 
iGuL ], which proves the lovely result that if G is a simple Chevalley group, then the probability of 
not being semisimple is at most 3/(g — 1) + 2/(g — 1)-^. 
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A crude asymptotic understanding of the behavior of the proportion of semisimple elements 
in GL{n,q) is in [Q, who used generating functions. The paper |F5] uses one of the Rogers- 



Ramanujan identities to show that n ^ oo probabihty that an element of GL{n, q) is semisimple 



IS 



r = 0,±2(mod 5) 

The paper |FNP| gives bounds for finite n. 



T- = l V-^ qr J 



Let ssA{n, q) be the probability that an element of A{n, q) is semisimple and let ssa{oo, q) be the 
n — > oo limit of ssA{n.,q). Let SSa{u,q) = Z]5?^o^"'5*^("-'9) ^^^ ^^^ SSgl be the corresponding 
generating function for GL. 

Theorem 15 

k kd 

SSAiu,q) = (E ^^TT^) IKE ^^r^''^^ 



(Efc>0 g.^+" (1), 



-SSgl{u,' 



Proof: An element a of GL(n, q) is semisimple if and only if all A(^(a) have at most one column. 
Now use Corollary |^. □ 

To calculate ssAioo,q) we will use (with q replaced by 1/q) the well known Roger s-Ramanuj an 

identities (see Q for discussion) 

oo „2 oo ^ 

oo n{n+l) oo ^ 

^ ^ 5 (1 - g)(l - 9^) • • • (1 - q-) = n (1 _ ^5n-2)(l _ ^5„-3) ■ 

Corollary 4 

n°° .=1 (i-i/9on°" .=1 (i-i/g^-') 

/ \ r=0,±l mod 5 r— 0,±2 mod 5 

'^^^°°''^^ = u^^ir^T^W? 

r = 0,±2 mod 5 
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Proof: The discussion in |FNP| shows that SScLiuiQ) is analytic within a circle of radius greater 



than 1, except for a simple pole at u = 1. Since 



(Efc>o g..+" (i) J 



^1' 



^^^^o?^(Ik- 



^9' 



is also analytic within a circle of radius greater than 1, it follows that 

ssA{oo,q) = — — ^ " ^ ssGL{oo,q)- 



Now simply use both Rogers-Ramanujan identities and the formula for ssGL{oo,q) in |F5| stated 
at the beginning of this subsection. □ 

Bounding the convergence rate of ssAin, q) through generating functions is an involved analytic 

excursion which we omit. The following elementary bound is sufficient for practical purposes, given 

the results of the previous subsections and the fact that ssA{n,q) can be computed explicitly for 

small n from the generating function. 
Theorem 16 

SAin, q) < ssA{n, q) < SA^n, q) + {1 - CA{n, q)). 

Proof: The first inequality follows because semisimple matrices are separable. The second in- 
equality follows because a matrix which is not separable is either not cyclic or not semisimple. □ 
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